# Visual feature extraction
Comp SigLIP So400M
Apache-2.0
CoMP-MM-1B is a visual foundation model (VFM) that supports native image resolution input, continuously pre-trained based on SigLIP.
Multimodal Fusion
C
SliMM-X
33
1
Sam2 Hiera Large.fb R1024 2pt1
Apache-2.0
SAM2 model based on HieraDet image encoder, focusing on efficient image feature extraction
Image Segmentation
Transformers

S
timm
31
0
Sam2 Hiera Large.fb R1024
Apache-2.0
SAM2 model based on the timm library, containing only the HieraDet image encoder part, suitable for image feature extraction tasks.
Image Segmentation
Transformers

S
timm
747
0
Dino Vits16
Apache-2.0
A self-supervised Vision Transformer model trained using the DINO method, suitable for image feature extraction
Image Classification
Transformers

D
facebook
47.32k
16
Featured Recommended AI Models